Custom Content Policies

Overview

Content policies can be used to create customized policies that are aligned to specific organizational requirements. Examples of custom policies include: avoiding financial advice, not mentioning a particular competitor. Input Content Policies can be used to detect non-compliant user inputs, while Output Content Policies can be used to detect non-compliant model responses

Content Policy Actions

Content policies currently enable flagging and blocking content.

Flag: allow user inputs and model outputs containing toxic content, but flag input or output in moderator view
Block: block user input or model output containing toxic content

Out-of-the-box Policy Inventory

In addition to providing tooling for custom guardrail creation, Dynamo Guard provides the following default guardrails to help your enterprise address common model safety and compliance scenarios.

Policy	Input or Output	Definition	Date Updated
Prompt Injection	Input	Detects prompt injection attacks.	07-15-2024
Legal Advice	Input	Detects user inputs requesting legal advice.	07-15-2024
Financial Advice	Input	Detects user inputs requesting financial or investment advice.	07-15-2024
Prohibit Discrimination (Coming Soon)	Input	Prohibits prompts that discriminate or are discriminatory in nature towards any individual or group of individuals.	Coming Soon
Material Non-Public Information (Coming Soon)	Input	Prohibits prompts that include Material Non-Public Information.	Coming Soon
Compensation Data (Coming Soon)	Input	Prohibits prompts that request or provibe sensitive compensation data.	Coming Soon

Overview​

Content Policy Actions​

Out-of-the-box Policy Inventory​

Overview

Content Policy Actions

Out-of-the-box Policy Inventory